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Method of executing a computer program with an interpreter, computer system and computer 
program product. 



The invention relates to a method of executing a program with a computer 
system. The invention also relates to a computer system and a computer program product. 

The Java computer language uses an execution model that involves a 
definition of a virtual machine and programs of instructions, also called byte codes, in a 
5 virtual machine language that is supported by the virtual machine. The Java program may be 
s=t executed with a machine that can only execute instructions from its own instruction set, so- 

r f called native instructions, which are not identical to the virtual machine instructions. A 

l=* typical implementation of the virtual machine involves an interpreter, which is implemented 

5 J for example as a program running on a computer system. The interpreter reads the virtual 

y { 10 machine instructions. The interpreter selects native instructions that implement these virtual 
» machine instructions and finally the interpreter causes the execution of these native 

'Q\ instructions 

^ In the Java language a virtual machine instruction may contain a symbolic 

p reference. Such a symbolic reference refers to the name (or type or structure etc.) of an 

|S= " 15 operand, rather than to the actual value. In case of the Java language, a symbolic reference 

refers to an object in a so-called "constant pool". The object in turn may refer to other objects 
and so on. When the interpreter executes an instruction with a symbolic reference, this 
symbolic reference is first "resolved": the references are followed and translated until a result 
is produced that can be used as a practical input for the execution by the interpreter. If 
20 necessary, as part of resolving a reference, objects such as variables or classes that are 
referred to are loaded or prepared in another way (e.g. initialized) to realize a practical 
operand. Furthermore, a search is made for definitions of classes, operand size is determined, 
access privilege is checked etc. During the execution of a later virtual machine instruction in 
the program, the same symbolic reference may again be encountered. In principle, this means 
25 that part or all the work for the symbolic reference will be executed again. This work takes a 
considerable part of execution time. 

From US patent No. 5,367,685 it is known to modify a virtual machine 
instruction at run-time after resolution by storing the practical result of resolution in the 
virtual machine instruction as a replacement of the symbolic reference. Usually the virtual 
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machine instruction itself is replaced, to account for the altered type of operand. Thus, it 
won't be necessary to resolve the symbolic reference more than once if the same virtual 
machine instruction (from the same place in the program) is executed again. This does not 
exclude a need for repeated resolution if other instructions at different locations in the 
program contain the same symbolic reference. 

It is an object of the invention to reduce the amount of time needed for 
resolving symbolic references during execution of instructions that contain such symbolic 
references. 



According to the invention, groups of memory locations are provided, each 
supporting a storage function for storing a result of resolving a symbolic references. Different 
groups of memory locations are provided for different categories of instruction types (a 
category may consist of only one type of instruction; different instructions of the same type 

1 5 have the same op-code for example). When an instruction of a particular category with a 
symbolic reference is encountered, only the group of memory locations for that particular 
category is consulted to determine whether a result of resolving the symbolic reference has 
been stored. If it is not stored, the symbolic reference is resolved and the result is stored in 
the group of memory locations provided for the particular category. In case of the Java 

20 language for example, different instruction categories are distinguished for M get_field M , 

"invoke_method n and "new" virtual machine instructions etc. In principle instructions in the 
same category differ only in the operands they refer to and concern the same operation, but it 
is also envisaged that instructions that concern different but related operation may belong to 
the same category. 

25 The use of memory locations for storing the results of resolution in 

combination with the symbolic reference involved speeds up execution even if a single 
memory is used for all types of instruction. The speed is improved further by using different 
specialized groups of memory locations for symbolic references in different categories of 
instructions. This has the advantage that the number of symbolic references per group is 

30 smaller than the overall number of symbolic references. This reduces the time needed to 
determine whether a symbolic reference is present in the memory locations, at little or no 
additional cost because the type of operation of the instruction (from which the category 
follows) has to be determined anyway during interpretation. Of course, this implies that, if 
the same symbolic reference were to be used in instructions of different categories, the 
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symbolic reference may have to be resolved repeatedly, once for each category. However, it 
has been found that this loss is small or non-existent because the same reference is normally 
used only in only one or a very limited number of categories. 

Moreover, the result of resolution can be specialized for the category of 
5 instruction. If different practical results of resolution of the same symbolic reference are 
needed for instructions of different categories, the practical result needed for instructions of 
that category may be stored. If no distinction per category was made, one could only store a 
less resolved result that could serve as a basis for resolving the practical category for all 
categories of instructions. 

10 

These and other advantageous aspects of the method and computer system 
according to the invention will be described in more detail using of the following figures: 

Figure 1 shows a computer architecture; 
^ I 15 Figure 2 shows a flow-chart for executing a program. 

Figure 1 shows a computer architecture comprising a processor 10, an 
M interpreter 12, an instruction memory 14, a data memory 16 and a resolution result memory 

ri 18. The resolution result memory 18 is structured into sub-memories 180, 182, 184, 186. The 

1=5 20 interpreter is coupled to processor 10, instruction memory 14, data memory 16 and resolution 
result memory 18. It is to be noted that figure 1 shows an architecture rather than any specific 
circuit. This means that the individual component shown can be implemented in various 
ways. One way of implementing the components is to use different circuits for the processor 
10, instruction memory 14, data memory 16 and sub-memories 180, 182, 184, 186 of 
25 resolution result memory 18. However, the implementation of functions of the components 
may be combined into shared units. For example, different memory regions of a common 
memory may be used for instruction memory 14, data memory 16 and the sub-memories 180, 
182, 184, 1 86 of resolution result memory 18. These regions need not be continuous regions: 
memory locations for instruction memory 14, data memory 16 and the sub-memories 180, 
30 1 82, 1 84, 1 86 of resolution result memory 1 8 may be interleaved in any way if necessary. As 
another example, interpreter 12 may be implemented as a program stored in a memory and 
run on processor 10 or the interpreter may be a preprocessor that translates instructions for 
processor 10. As a further example, processor 10 and interpreter 12 may be part of a single 
processor capable of executing virtual machine instructions directly, i.e. without translation 
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into native machine instructions. In this case native instructions and machine instructions are 
synonymous. As another example, any combination of such implementations may be used. 

By way of illustration the operation of the computer architecture will be 
described in terms of interpretation of a program of JAVA virtual machine instructions, 
5 although the invention is not limited to JAVA programs. Each JAVA virtual machine 
instruction (also called byte code) identifies an operation to be performed. Certain JAVA 
virtual machine instructions also define an operand for use in the operation, the operand 
being defined as an explicit number or by means of a reference to the so-called "constant 
pool", which is a data structure stored in data memory 16 during execution of the JAVA 
10 program. Examples of JAVA virtual machine instructions that contain a reference to the 

constant pool are the instructions called anewarray, checkcast, getfield, getstatic, instanceof, 
invokeinterface, invokespecial, invokestatic, invokevirtual, ldc, ldc w, ldc2_w, 
multianewarray, new, putfield, putstatic. The reference to the constant pool refers for 
example to an item in the constant pool which in turn refers to a class name. As another 
1 5 example the reference may refer to an item that in turn contains a first and second reference, 
the first reference referring to another item that defines a class name, the second reference 
referring to another item that defines a field name. 

Interpretation of such a virtual machine instruction involves "resolution" of the 
reference. In case of the "getfield" virtual machine instruction the reference to the constant 
20 pool ultimately has to be converted into an address for addressing memory locations that 

store the content of a field of a class. To realize this, resolution involves looking up the class 
involved and the location of the field in the class. 

In case of the "new" instruction, the reference to the constant pool has to be 
converted to an address of a description of an object that is to be created. 
25 Figure 2 shows a flow-chart illustrating the operation of the computer 

architecture of figure 1. In a first step 21 interpreter 12 loads a virtual machine instruction 
from instruction memory 14. In a second step 22, interpreter 12 determines the category type 
of the virtual machine instruction. The second step 22 selects, dependent on the type of 
instruction (or the category to which this type belongs), which one of a number of alternative 
30 third steps 23a, 23b, 23c is executed. Only one of these alternative third steps 23a is shown in 
more detail. 

The third steps 23a,b,c make use of resolution result memory 18. In a first sub- 
step 230 of the third step 23, the interpreter 12 takes the constant pool reference contained in 
the virtual machine instruction that is being interpreted and searches one of the sub-memories 
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180 5 182, 184, 186 to determine whether a result of resolving the constant pool reference is 
stored in that sub-memory 180, 182, 184, 186. The interpreter 12 selects the sub-memory 
180, 182, 184, 186 dependent on the type (category) of virtual machine instruction that is 
being interpreted. For example, in case of a "getfield" or "putfield" instruction, which belong 
5 to the same category, a first one 180 of the sub-memories is searched and in case of a "new" 
instruction a second one 1 82 of the sub-memories is searched. 

In a second sub-step 232 the interpreter 12 determines the further course of 
action dependent on whether or not a resolution result was found in the first sub-step 230. If a 
resolution result was found, the interpreter 12 immediately executes a fourth sub-step 234 
10 which causes the processor 10 to execute one or more native instructions that implement the 
virtual machine instruction that is being executed. During execution by the processor 10 the 
j=j result of resolution is fed to the processor 10 as an operand. It should be noted that the 

M resolution result can be made up of a set of numbers, for example an address of a variable 

m and a length code describing the number of words that make up the variable. 

\f\ 15 If no resolution result is found in second sub-step 232, the interpreter 12 

executes fifth and sixth substeps 235, 236. In fifth substep 235 the interpreter 12 resolves the 
Ta constant pool reference by loading information from the object referred to by the reference 

^ from data memory 16, and if necessary by loading data referred to by that information and so 

ii) on. Eventually, this results in a resolution result that can be used in the fourth substep 234. In 

|S=S 20 the sixth substep 236 the interpreter 12 selects one of the sub-memories 180, 182, 184, 186 
dependent on the type of the instruction that is being interpreted and the interpreter 12 stores 
the resolution result in that selected sub-memory 180, 182, 184, 186 in combination with the 
constant pool reference from the instruction. Subsequently, interpreter 12 proceeds with the 
fourth sub-step 234 as in the case when the resolution result was retrieved from the sub- 
25 memory 180, 182, 184, 186. 

After completion of the fourth substep 234, the third step 23a is complete and 
the interpreter 12 steps to the next virtual machine instruction in a fourth step 24 of the flow- 
chart. After that the interpreter 12 repeats the process from the first step 21 . 

The alternatives 23a, 23b, 23c for the third step of the flow-chart all have 
30 similar structure, but each uses a different one of the submemories 180, 182, 184, 186 of the 
resolution result memory 18, so that the submemories 180, 182, 184, 186 are used for 
different categories of instructions. In general, the type of resolution result produced by the 
equivalents of fifth substep 235 in the alternatives 23a, 23b, 23c of the third step will also be 
different in different alternative steps 23a, 23b, 23c. Preferably, the resolution result is 
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worked out in the fifth substep 235 to the point of execution of a virtual machine instruction 
where the result is the same for all executions of the virtual machine instructions of the type 
for which the alternative step 23a, 23b, 23c is allocated. This worked out result is stored in 
the submemory 180, 182, 184, 186 in the sixth substep 236. However, without deviating from 
5 the invention, a result may be stored in the submemory 180, 182, 184, 186 for example to 
save memory space. In this case some of the working out of the result is moved to the fourth 
substep 234. 

In the sub-memories 180, 182, 184, 186 the resolution result may be stored in 
combination with the constant pool reference in various ways. Basically, each sub-memory 
10 180, 182, 184, 186 must perform an associative memory function, allowing the retrieval of a 
resolution result on the basis of a constant pool reference. 

This may be realized in many ways that are known per se. In one embodiment 
a binary search tree structure is used. In this embodiment one uses pairs of a constant pool 
reference and a resolution result. Each pair is assigned for example to a node or leaf in the 
1 5 tree structure. Preferably, the tree is structured so that two approximately equal sized sub- 
trees are appended to every node. The sub-trees appended to any particular node end in 
leaves with constant pool references that are bigger and not bigger than a threshold value for 
the particular node for one sub-tree and the other respectively. Typically the threshold value 
is one of the constant pool references that is actually used. 
20 Thus, a required constant pool reference can be searched for by starting at a 

root node of the tree, comparing the reference with the threshold of the node and selecting the 
sub-tree of that node that may contain the reference on the basis of that comparison and so on 
until one arrives at a leaf node. A tree structure is implemented for example by representing 
each node with a stored data-structure that contains the threshold and pointers to the data- 
25 structures of further nodes that form the roots of the subtrees appended to the node. It will be 
noted that in this case memory locations from submemories 1 80, 1 82, 1 84, 1 86 are 
distinguished only by storing (constant pool reference resolution, result) pairs belonging to 
different trees, and may occur interspersed with memory locations from other trees. Of 
course, to speed up searching a pair may also be associated with a node of a tree, the constant 
30 pool reference of the pair serving as threshold for the node. 

In principle, a respective sub-memory 180, 182, 184, 186 and alternative third 
step 23a-c may be provided for every possible virtual machine instruction from the 
instruction set which uses an object-reference. In the case of JAVA virtual machine 
instructions a respective sub-memory 180, 182, 184, 186 and alternative third step 23a-c may 
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be provided for the anewarray, checkcast, getfield, getstatic, instanceof, invokeinterface, 
invokespecial, invokestatic, invokevirtual, ldc, idc_w, ldc2_w, multianewarray, new, 
putfield, pustatic instructions. However, instructions from some groups of these virtual 
machine instructions, such as a group containing putfield and getfield, require identical 
5 constant pool resolution. Also, the same constant pool reference may be used in instances of 
all of the instructions from the same group. Therefore a common sub-memory 180, 182, 184, 
186 and alternative third step 23a-c is preferably provided for such a group. The distinction 
between the different instructions in the group is made only in the fourth sub-step 234. 



10 are a particular JAVA virtual machine instruction and its "quick" variants. JAVA has for 

!SS% example the following quick variants: ldcquick, ldc_w_quick, ldc2_w_quick, 

II getfield_quick, putfield_quick, getfield2_quick, putfield2_quick, getstatic_quick, 

Q putstatic_quick, getstatic2_quick, putstatic2_quick, invoke virtual_quick, 

l Zl invokenonvirtual_quick, invokesuperquick, invokestatic_quick, invokeinterface_quick, 

W 15 invoke virtualobject_quick, new_quick, anewarrayquick, multianewarray_quick, 



of a quick variant does not require resolution or search for a reference that has already been 



20 resolved. For such quick variants, therefore, no search is executed. Preferably, once the 

object reference of an instruction has been resolved this instruction is replaced in memory by 
its quick variant with the result of resolution as operand (of course this is only possible if the 
program memory is not a ROM). In addition the result of resolution is stored in one of the 
sub-memories. Thus, if the instruction from a certain memory location is executed again it 

25 will be executed as a quick variant without consulting the sub-memory, but if the same type 
of instruction with the same operand from a different memory location, the result of 
resolution from the sub-memory is used as operand. 



more information about the resolution in result than other instructions from the group. 
30 Preferably, the maximum amount of information needed for any member of the group is 

stored in the sub-memory 180, 182, 184, 186, irrespective of whether all of this information 
is used for the particular instruction for which resolution was performed. Thus all members 
of the group can use the resolution result from the sub-memory 180, 182, 184, 186. 



Other instructions from the instruction set that may be placed in such a group 



checkcast_quick, instanceof_quick, invokevirtual_quick_w, getfield_quick_w, 
putfield_quick_w. 

Quick variants can also use resolved data as operand(s). In this case, execution 



In some cases some instructions from a group of such instructions may need 
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Without deviating from the invention the interpreter 12 may use a step 



common for all instructions to search for the resolution result from the resolution result 
memory 18, using the instruction type as an index to select a particular sub-memory 180, 
182, 184, 186. 



5 



In principle all resolution results produced during execution of the program 



may be kept in resolution result memory 1 8 once they have been found until completion of 
the program, or until completion of a program section to which these results are local. In case 
it is desirable to reduce memory usage it may be considered to replace a resolution result for 
one constant pool reference by another resolution result for a different constant pool 

10 reference, for example replacing the least recently used resolution result. This may mean that 
the replaced result may have to be resolved again, but it reduces the amount of needed 

I resolution result memory, or at least it may provide a guaranteed maximum memory use. 



